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L AMENDMENTS 

In the specification: 

Please replace the paragraph beginning on page 4, line 13 with the following rewritten 
paragraph: 

;| Thus, provided herein is a zinc finger protein that binds to a target site, said zinc finger 

protein comprising a first (Fl), a second (F2), and a third (F3) zinc finger, ordered Fl, F2, F3 
from N-terminus to C-terminus, said target site comprising, in 3' to 5' direction, a first (SI), a 
second (S2), and a third (S3) target subsite, each target subsite having the nucleotide sequence 
GNN, wherein if SI comprises GAA, Fl comprises the amino acid sequence QRSNLVR (SEQ 
ID NO: 158); if S2 comprises GAA, F2 comprises the amino acid sequence QSGNLAR (SEQ ID 
NO:801); if S3 comprises GAA, F3 comprises the amino acid sequence QSGNLAR (SEQ ID 
NO:801); if SI comprises GAG, Fl comprises the amino acid sequence RSDNLAR (SEQ ID 
NO: 130); if S2 comprises GAG, F2 comprises the amino acid sequence RSDNLAR (SEQ ID 
NO: 130); if S3 comprises GAG, F3 comprises the amino acid sequence RSDNLTR (SEQ ID 
NO:231); if SI comprises GAC, Fl comprises the amino acid sequence DRSNLTR (SEQ ID 
NO:395); if S2 comprises GAC, F2 comprises the amino acid sequence DRSNLTR (SEQ ID 
NO:395); if S3 comprises GAC, F3 comprises the amino acid sequence DRSNLTR (SEQ ID 
NO:395); if SI comprises GAT, Fl comprises the amino acid sequence QSSNLAR (SEQ ID 
NO: 1765); if S2 comprises GAT, F2 comprises the amino acid sequence TSGNLVR (SEQ ID 
NO: 1442); if S3 comprises GAT, F3 comprises the amino acid sequence TSANLSR (SEQ ID 
NO:377); if SI comprises GGA, Fl comprises the amino acid sequence QSGHLAR (SEQ ID 
NO:413); if S2 comprises GGA, F2 comprises the amino acid sequence QSGHLQR (SEQ ID 
NO:287); if S3 comprises GGA, F3 comprises the amino acid sequence QSGHLQR (SEQ ID 
NO:287); if SI comprises GGG, Fl comprises the amino acid sequence RSDHLAR (SEQ ID 
NO: 127); if S2 comprises GGG, F2 comprises the amino acid sequence RSDHLSR (SEQ ID 
NO:229); if S3 comprises GGG, F3 comprises the amino acid sequence RSDHLSR (SEQ ID 
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NO:229); if SI comprises GGC, Fl comprises the amino acid sequence DRSHLRT (SEQ ID 
NO: 1506); if S2 comprises GGC, F2 comprises the amino acid sequence DRSHLAR (SEQ ID 
NO: 1092); if SI comprises GGT, Fl comprises the amino acid sequence QSSHLTR (SEQ ID 
NO:835); if S2 comprises GGT, F2 comprises the amino acid sequence TSGHLSR (SEQ ID 
NO: 1201); if S3 comprises GGT, F3 comprises the amino acid sequence TSGHLVR (SEQ ID 
NO: 1425); if SI comprises GCA, Fl comprises the amino acid sequence QSGSLTR (SEQ ID 
NO:342); if S2 comprises GCA, F2 comprises QSGDLTR (SEQ ID NO:220); if S3 comprises 
GCA, F3 comprises QSGDLTR (SEQ ID NO:220); if SI comprises GCG, Fl comprises the 
amino acid sequence RSDDLTR (SEQ ID NO: 188); if S2 comprises GCG, F2 comprises the 
amino acid sequence RSDDLQR (SEQ ID NO: 1844); if S3 comprises GCG, F3 comprises the 
amino acid sequence RSDDLTR (SEQ ID NO: 188); if SI comprises GCC, Fl comprises the 
amino acid sequence ERGTLAR (SEQ ID NO: 131); if S2 comprises GCC, F2 comprises the 
amino acid sequence DRSDLTR (SEQ ID NO:417); if S3 comprises GCC, F3 comprises the 
amino acid sequence DRSDLTR (SEQ ID NO:417); if SI comprises GCT, Fl comprises the 
amino acid sequence QSSDLTR (SEQ ID NO: 1450); if S2 comprises GCT, F2 comprises the 
amino acid sequence QSSDLTR (SEQ ID NO: 1450); if S3 comprises GCT, F3 comprises the 
amino acid sequence QSSDLQR (SEQ ID NO:132); if SI comprises GTA, Fl comprises the 
amino acid sequence QSGALTR (SEQ ID NO: 1398); if S2 comprises GTA, F2 comprises the 
amino acid sequence QSGALAR (SEQ ID NO:3339); if SI comprises GTG, Fl comprises the 
amino acid sequence RSDALTR (SEQ ID NO: 153); if S2 comprises GTG, F2 comprises the 
amino acid sequence RSDALSR (SEQ ID NO:237); if S3 comprises GTG, F3 comprises the 
amino acid sequence RSDALTR (SEQ ID NO:153); if SI comprises GTC, Fl comprises the 
amino acid sequence DRSALAR (SEQ ID NO:184); if S2 comprises GTC, F2 comprises the 
amino acid sequence DRSALAR (SEQ ID NO: 184); and if S3 comprises GTC, F3 comprises the 
amino acid sequence DRSALAR (SEQ ID NO:184).-|-' — ~* 



Please replace the paragraph beginning on page 5, line 23 with the following rewritten 
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paragraph: 

m \ Also provided are methods of designing a zinc finger protein comprising a first (Fl), 



second (F2), and a third (F3) zinc finger, ordered Fl, F2, F3 from N-terminus to C-terminus that 
binds to a target site comprising, in 3' to 5' direction, a first (SI), a second (S2), and a third (S3) 
target subsite, each target subsite having the nucleotide sequence GNN, the method comprising 
the steps of (a) selecting the Fl zinc finger such that it binds to the SI target subsite, wherein if 
SI comprises GAA, Fl comprises the amino acid sequence QRSNLVR (SEQ ID NO:158); if SI 
comprises GAG, Fl comprises the amino acid sequence RSDNLAR (SEQ ID NO: 130); if SI 
comprises GAC, Fl comprises the amino acid sequence DRSNLTR (SEQ ID NO:395); if SI 
comprises GAT, Fl comprises the amino acid sequence QSSNLAR (SEQ ID NO: 1765); if SI 
comprises GGA, Fl comprises the amino acid sequence QSGHLAR (SEQ ID NO:413); if SI 
comprises GGG, Fl comprises the amino acid sequence RSDHLAR (SEQ ID NO: 127); if SI 
comprises GGC, Fl comprises the amino acid sequence DRSHLRT (SEQ ID NO: 1506); if SI 
comprises GGT, Fl comprises the amino acid sequence QSSHLTR (SEQ ID NO:835); if SI 
comprises GCA, Fl comprises QSGSLTR (SEQ ID NO:342); if SI comprises GCG, Fl 
comprises RSDDLTR (SEQ ID NO:188); if S2 comprises GCG, F2 comprises RSDDLQR (SEQ 
ID NO:1844); if SI comprises GCC, Fl comprises ERGTLAR (SEQ ID NO:131); if SI 
comprises GCT, Fl comprises the amino acid sequence QSSDLTR (SEQ ID NO: 1450); if SI 
comprises GTA, Fl comprises the amino acid sequence QSGALTR (SEQ ID NO: 1398); if SI 
comprises GTG, Fl comprises the amino acid sequence RSDALTR (SEQ ID NO:153); if SI 
comprises GTC, Fl comprises the amino acid sequence DRSALAR (SEQ ID NO: 184); (b) 
selecting the F2 zinc finger such that it binds to the S2 target subsite, wherein S2 comprises 
GAA, F2 comprises the amino acid sequence QSGNLAR (SEQ ID NO:801); if S2 comprises 
GAG, F2 comprises the amino acid sequence RSDNLAR (SEQ ID NO: 130); if S2 comprises 
GAC, F2 comprises the amino acid sequence DRSNLTR (SEQ ID NO:395); if S2 comprises 
GAT, F2 comprises the amino acid sequence TSGNLVR (SEQ ID NO: 1442); if S2 comprises 
GGA, F2 comprises the amino acid sequence QSGHLQR (SEQ ID NO:287); if S2 comprises 
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GGG, F2 comprises the amino acid sequence RSDHLSR (SEQ ID NO:229); if S2 comprises 
GGC, F2 comprises the amino acid sequence DRSHLAR (SEQ ID NO: 1 092); if S2 comprises 
GGT, F2 comprises the amino acid sequence TSGHLSR (SEQ ID NO: 1201); if S2 comprises 
GCA, F2 comprises the amino acid sequence QSGDLTR (SEQ ID NO:220); if S2 comprises 
GCC, F2 comprises the amino acid sequence DRSDLTR (SEQ ID NO:417); if S2 comprises 
GCT, F2 comprises the amino acid sequence QSSDLTR (SEQ ID NO: 1450); if S2 comprises 
GTA, F2 comprises the amino acid sequence QSGALAR (SEQ ID NO:3339); if S2 comprises 
GTG, F2 comprises the amino acid sequence RSDALSR (SEQ ID NO:237); if S2 comprises 
$\ GTC, F2 comprises the amino acid sequence DRS ALAR (SEQ ID NO: 1 84); and (c) selecting the 
F3 zinc finger such that it binds to the S3 target subsite, wherein if S3 comprises GAA, F3 
comprises the amino acid sequence QSGNLAR (SEQ ID NO:801); if S3 comprises GAG, F3 
comprises the amino acid sequence RSDNLTR (SEQ ID NO:231); if S3 comprises GAC, F3 
comprises the amino acid sequence DRSNLTR (SEQ ID NO:395); if S3 comprises GAT, F3 
comprises the amino acid sequence TSANLSR (SEQ ID NO:377); if S3 comprises GGA, F3 
comprises the amino acid sequence QSGHLQR (SEQ ID NO:287); if S3 comprises GGG, F3 
comprises RSDHLSR (SEQ ID NO:229); if S3 comprises GGT, F3 comprises the amino acid 
sequence TSGHLVR (SEQ ID NO:1425); if S3 comprises GCA, F3 comprises the amino acid 
sequence QSGDLTR (SEQ ID NO:220); if S3 comprises GCG, F3 comprises the amino acid 
sequence RSDDLTR (SEQ ID NO:188); if S3 comprises GCC, F3 comprises the amino acid 
sequence DRSDLTR (SEQ ID NO:417); if S3 comprises GCT, F3 comprises the amino acid 
sequence QSSDLQR (SEQ ID NO:132); if S3 comprises GTG, F3 comprises RSDALTR (SEQ 
ID NO: 153); and if S3 comprises GTC, F3 comprises the amino acid sequence DRS ALAR (SEQ 
ID NO: 184); 

thereby designing a zinc finger protein that binds to a target site. -j- ~~ 



Please replace the paragraph beginning on page 7, line 5 with the following rewritten 
paragraph: 
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.4 In certain embodiments of the zinc finger proteins and methods described herein, SI 

comprises GAA and Fl comprises the amino acid sequence QRSNLVR (SEQ ID NO: 158). In 
other embodiments, S2 comprises GAA and F2 comprises the amino acid sequence QSGNLAR 
(SEQ ID NO:801). In other embodiments, S3 comprises GAA and F3 comprises the amino acid 
sequence QSGNLAR (SEQ ID NO:801). In other embodiments, SI comprises GAG and Fl 
comprises the amino acid sequence RSDNLAR (SEQ ID NO: 130). In other embodiments, S2 
comprises GAG and F2 comprises the amino acid sequence RSDNLAR (SEQ ID NO: 130). In 
other embodiments, S3 comprises GAG and F3 comprises the amino acid sequence RSDNLTR 
(SEQ ID NO:231). In other embodiments, SI comprises GAC and Fl comprises the amino acid 
sequence DRSNLTR (SEQ ID NO:395). In other embodiments, S2 comprises GAC and F2 
comprises the amino acid sequence DRSNLTR (SEQ ID NO:395). In other embodiments, S3 
comprises GAC and F3 comprises the amino acid sequence DRSNLTR (SEQ ID NO:395). In 
other embodiments, SI comprises GAT and Fl comprises the amino acid sequence QSSNLAR 
(SEQ ID NO: 1765). In other embodiments, S2 comprises GAT and F2 comprises the amino acid 
sequence TSGNLVR (SEQ ID NO: 1442). In other embodiments, S3 comprises GAT and F3 
comprises the amino acid sequence TSANLSR (SEQ ID NO:377). In other embodiments, SI 
comprises GGA and Fl comprises the amino acid sequence QSGHLAR (SEQ ID NO:413). In 
other embodiments, S2 comprises GGA and F2 comprises the amino acid sequence QSGHLQR 
(SEQ ID NO:287). In other embodiments, S3 comprises GGA and F3 comprises the amino acid 
sequence QSGHLQR (SEQ ID NO:287). In other embodiments, SI comprises GGG and Fl 
comprises the amino acid sequence RSDHLAR (SEQ ID NO: 127). In other embodiments, S2 
comprises GGG and F2 comprises the amino acid sequence RSDHLSR (SEQ ID NO:229). In 
other embodiments, S3 comprises GGG and F3 comprises the amino acid sequence RSDHLSR 
(SEQ ID NO:229). In other embodiments, SI comprises GGC and Fl comprises the amino acid 
sequence DRSHLTR (SEQ ID NO:705). In other embodiments, S2 comprises GGC and F2 
comprises the amino acid sequence DRSHLAR (SEQ ID NO: 1092). In other embodiments, SI 
comprises GGT and Fl comprises the amino acid sequence QSSHLTR (SEQ ID NO:835). In 
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other embodiments, S2 comprises GGT and F2 comprises the amino acid sequence TSGHLSR 
(SEQ ID NO: 1201). In other embodiments, S3 comprises GGT and F3 comprises the amino acid 
sequence TSGHLVR (SEQ ID NO: 1425). In other embodiments, SI comprises GCA and Fl 
comprises the amino acid sequence QSGSLTR (SEQ ID NO:342). In other embodiments, S2 
comprises GCA and F2 comprises the amino acid sequence QSGDLTR (SEQ ID NO:220). In 
other embodiments, S3 comprises GCA and F3 comprises the amino acid sequence QSGDLTR 
(SEQ ID NO:220). In other embodiments, SI comprises GCG and Fl comprises the amino acid 
sequence RSDDLTR (SEQ ID NO: 1 88). In other embodiments, S2 comprises GCG and F2 
i comprises the amino acid sequence RSDDLQR (SEQ ID NO: 1 844). In other embodiments, S3 
comprises GCG and F3 comprises the amino acid sequence RSDDLTR (SEQ ID NO: 188). In 
other embodiments, SI comprises GCC and Fl comprises the amino acid sequence ERGTLAR 
(SEQ ID NO: 131). In other embodiments, S2 comprises GCC and F2 comprises the amino acid 
sequence DRSDLTR (SEQ ID NO:417). In other embodiments, S3 comprises GCC and F3 
comprises the amino acid sequence DRSDLTR (SEQ ID NO:417). In other embodiments, SI 
comprises GCT and Fl comprises the amino acid sequence QSSDLTR (SEQ ID NO: 1450). In 
other embodiments, S2 comprises GCT and F2 comprises the amino acid sequence QSSDLTR 
(SEQ ID NO: 1450). In other embodiments, S3 comprises GCT and F3 comprises the amino acid 
sequence QSSDLQR (SEQ ID NO: 132). In other embodiments, SI comprises GTA and Fl 
comprises the amino acid sequence QSGALTR (SEQ ID NO: 1398). In other embodiments, S2 
comprises GTA and F2 comprises the amino acid sequence QSGALAR (SEQ ID NO:3339). In 
other embodiments, SI comprises GTG and Fl comprises the amino acid sequence RSDALTR 
(SEQ ID NO: 153). In other embodiments, S2 comprises GTG and F2 comprises the amino acid 
sequence RSDALSR (SEQ ID NO:237). In other embodiments, S3 comprises GTG and F3 
comprises the amino acid sequence RSDALTR (SEQ ID NO: 153). In other embodiments, SI 
comprises GTC and Fl comprises the amino acid sequence DRSALAR (SEQ ID NO: 184). In 
other embodiments, S2 comprises GTC and F2 comprises the amino acid sequence DRSALAR 
(SEQ ID NO: 184). In other embodiments, S3 comprises GTC and F3 comprises the amino acid 

7 



/"I 



9-' 



AttyDktNo. 8325-0011.21 
S11-US3 
09/990,186 



sequence DRSALAR (SEQ ID NO:184).-|-" 



Please replace the paragraph beginning on page 9, line 10 with the following rewritten 
paragraph: 

A Figure 1 shows results of site selection analysis of two representative zinc finger 



proteins (leftmost 4 columns) and measurements of binding affinity for each of these proteins to 
their intended target sequences and to variant target sequences, (rightmost 3 columns). Analysis 
^ of ZFP1 is shown in the upper portion of the figure and analysis of ZFP2 is shown in the lower 

portion of the figure. For the site selection analyses, the amino acid sequences of residues -1 
through +6 of the recognition helix of each of the three component zinc fingers (F3 (ZFP1, SEQ 
ID NO:130; ZFP2, SEQ ID NO:420), F2 (ZFP1, SEQ ID NO.1051; ZFP2, SEQ ID NO:889) and 
Fl (ZFP1, SEQ ID NO:395; ZFP2, SEQ ID NO:685)) are shown across the top row; the intended 
target sequence (divided into finger-specific target subsites) is shown across the second row, and 
a summary of the sequences bound is shown in the third row. Data for F3 is shown in the second 
column, data for F2 is shown in the third column, and data for Fl is shown in the third column.- ^" 

Please replace the paragraph beginning on page 9, line 24 with the following rewritten 
paragraph: 

-f Figure 2 shows amino acid sequences of zinc finger recognition regions (amino acids 



-1 through +6 of the recognition helix) that bind to each of the 16 GNN triplet subsites. Three 
amino acid sequences are shown for each trinucleotide subsite; these correspond to optimal 
amino acid sequences for recognition of the subsite from each of the three positions (finger 1, 
F1(SEQ ID NOS:688, 2534, 676, 1769, 342, 1450, 131, 158, 1765, 395, 1407, 2644, 705, 1398, 
1733 & 184); finger 2, F2 (SEQ ID NOS:688, 229, 1446, 3051, 220, 1450, 417, 801, 1442, 395, 
1824, 1201, 972, 3339, 1 151 & 184); or finger 3, F3 (SEQ ID NOS:943, 229, 676, 1769, 220, 
1365, 417, 801, 3525, 395, 1824, 1425, 972, 3592, 952 & 184)) in a three-finger zinc finger ' 
protein. Amino acid sequences are from N-terminal to C-terminal; nucleotide sequences are 
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from 5' to 3\\- 



Please replace the paragraph beginning on page 10, line 1 with the following rewritten 
paragraph: 

•(- Also shown are site selection results for each of the 48 position-dependent GNN- 
recognizing zinc fingers. These show the number of times a particular nucleotide was present, at 
a given position, in a collection of oligonucleotide sequences bound by the finger. For example, 
out of 15 oligonucleotides bound by a zinc finger protein with the amino acid sequence 
QSGHLAR (SEQ ID NO:413) present at the finger 1 (Fl) position, 15 contained a G in the 5'- 
most position of the subsite, 15 contained a G in the middle position of the subsite, while, at the 
3 '-most position of the subsite, 10 contained an A, 3 contained a G and 2 contained a T. 
Accordingly, this particular amino acid sequence is optimal for binding a GGA triplet from the 
Fl position. |- ~~ ~ — — 

Please replace the paragraph beginning on page 10, line 10 with the following rewritten 
paragraph: 

J- Figures 3A, 3B and 3C show site selection data indicating positional dependence of 

GCA-, GAT- and GGT-binding zinc fingers. The first and fourth (where applicable) rows of 
each figure show portions of the amino acid sequence of a designed zinc finger protein (Fl 
column, SEQ ID NOS:220, 1765, 1442, 835 & 1425; F2 column, SEQ ID NOS:220, 1765, 1442, 
889, 1425; F3 column, SEQ ID NOS:220, 159, 377, 889 & 1425). Amino acid residues-1 
through +6 of each a-helix are listed from left to right. The second and fifth (where applicable) 
rows show the target sequence, divided into three triplet subsites, one for each finger of the 
protein shown in the first and fourth (where applicable) rows, respectively. The third and sixth 
(where applicable) rows show the distribution of nucleotides in the oligonucleotides obtained by 
site selection with the proteins shown in the first and fourth (where applicable) rows, 
respectively. Figure 3A shows data for fingers designed to bind GCA; Figure 3B shows data for 



/ 



AttyDktNo. 8325-0011.21 
S11-US3 
09/990,186 
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fingers designed to bind GAT; Figure 3C shows data for fingers designed to bind GGT. 4 

Please replace the paragraph beginning on page 10, line 21 with the following rewritten 
paragraph: 

_-f Figures 4A and 4B show properties of the engineered ZFP EP2C. Figure 4A shows 



site selection data. The first row provides the amino acid sequences (F3, SEQ ID NO: 1 100; F2 
SEQ ID NO:237; Fl, SEQ ID NO:1450) of residues -1 through +6 of the recognition helices for 
each of the three zinc fingers of the EP2C protein. The second row shows the target sequence (5' 
to 3'); with the distribution of nucleotides in the oligonucleotides obtained by site selection 
indicated below the target sequence.|-~~ " ~ 



Please replace the paragraph beginning on page 13, line 19 with the following rewritten 
paragraph: 

-(- A D-able subsite within a target site has the motif 5'NNGK3' (SEQ ID NO:4084). A 



target site containing one or more such motifs is sometimes described as a D-able target site. A 
zinc finger appropriately designed to bind to a D-able subsite is sometimes referred to as a D- 
able finger. Likewise a zinc finger protein containing at least one finger designed or selected to 
bind to a target site including at least one D-able subsite is sometimes referred to as a D-able zinc 
finger protein, f *~ — ■ 



Please replace the paragraph beginning on page 14, line 16 with the following rewritten 
paragraph: 

| Table 6 lists a collection of consensus sequences for zinc fingers and the target sites 



e amino 



bound by such sequences. Conventional one letter amino acid codes are used to designat 
acids occupying consensus positions. The symbol "X" designates a nonconsensus position that 
can in principle be occupied by any amino acid. In most zinc fingers of the C 2 H 2 type, binding 
specificity is principally conferred by residues -1, +2, +3 and +6. Accordingly, 



consensus 
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sequence determining binding specificity typically include at least these residues. Consensus 
sequences are useful for designing zinc fingers to bind to a given target sequence. Residues 
occupying other positions can be selected based on sequences in Tables 1-5, or other known zinc 
finger sequences. Alternatively, these positions can be randomized with a plurality of candidate 
amino acids and screened against one or more target sequences to refine binding specificity or 
improve binding specificity. In general, the same consensus sequence can be used for design of a 
zinc finger regardless of the relative position of that finger in a multi-finger zinc finger protein. 
For example, the sequence RXDNXXR (SEQ ID NO:4060) can be used to design a N-terminal, 
central or C-terminal finger of three finger protein. However, some consensus sequences are 
most suitable for designing a zinc finger to occupy a particular position in a multi-finger protein. 
For example, the consensus sequence RXDHXXQ (SEQ ID NO:4055) is most suitable for 
designing a C-terminal finger of a three-finger protein.| " 



Please replace the paragraph beginning on page 32, line 23 with the following rewritten 
paragraph: 

\ The amino acid sequence RSDXLXR (SEQ ID NO:4085) (position -1 to +6 of the 



recognition helix) was found to be optimal for binding to the four GNG triplets, with Asn +3 
specifying A as the middle nucleotide; His +3 specifying G as the middle nucleotide; Ala +3 
specifying T as the middle nucleotide; and Asp +3 specifying cytosine as the middle nucleotide. 
At the +5 position, Ala, Thr, Ser, and Gin, were tested, and all showed similar specificity profiles 
by site selection. Interestingly, and in contrast to a previous report (Swirnoff et al. (1995) Mol. 
Cell. Biol. 15:2275-2287), site selection results indicated that three naturally-occurring GCG- 
binding fingers from zif268 and Spl, having the amino acid sequences RSDELTR (SEQ ID 
NO:123), RSDELQR (SEQ ID NO:302), and RSDERKR (SEQ ID NO:l 100), were not GCG- 
specific. Rather, each of these fingers selected almost equal numbers of GCG and GTG 
sequences. Analysis of binding affinity by gel-shift experiments confirmed that finger 3 of 
zif268, having the sequence RSDERKR (SEQ ID NO: 1 100), binds GCG and GTG with 
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f _ _ 

py*** approximately equal affinity, f ~ " 

Please replace the paragraph beginning on page 33, line 5 with the following rewritten 
paragraph: 

|- Based on existing design rules, the amino acid sequence QSGDLTR (SEQ ID NO:220) 
(-1 through +6) was tested for its ability to bind the GCA triplet from three positions (Fl, F2, and 
F3) within a three-finger ZFP. Figure 3A shows that the QSGDLTR (SEQ ID NO:220) sequence 
bound preferentially to the GCA triplet subsite from the F2 and F3 positions, but not from Fl . 
V* In fact, the presence of QSGDLTR (SEQ ID NO:220) at the Fl position of three different three- 

finger ZFPs resulted predominantly in selection of GCT. Accordingly, an attempt was made to 
redesign this sequence to obtain specificity for GCA from the Fl position. Since the sequence 
q-i g+ 2 S+ 3 R+ 6 ( SE q ID NO:4065) had previously been selected from a randomized Fl library 
using GCA as target (Rebar et al. (1994) Science 263:671-673), a D (asp) to S (ser) change was 
made at the +3 residue of this finger. The resulting sequence, QSGSLTR (SEQ ID NO:342), was 
tested for its binding specificity by site selection and found to preferentially bind GCA, from the 
Fl position, in three different ZFPs (see Figure 2).-|-~ " — " 



Please replace the paragraph beginning on page 33, line 17 with the following rewritten 
paragraph: 

|- The QSGSLTR (SEQ ID NO:342) zinc finger, optimized for recognition of the GCA 

subsite from the Fl position, was tested for its selectivity when located at the F2 position. 
^ Accordingly, two ZFPs, one containing QSGSLTR (SEQ ID NO:342) at finger 2 and one 

^ containing QSGDLTR (SEQ ID NO:220) at finger 2 (both having identical Fl sequences and 

identical F3 sequences) were tested by site selection. The results indicated that, when used at the 
F2 position, QSGSLTR (SEQ ID NO:342) bound preferentially to GTA, rather than GCA. Thus, 
for optimal binding of a GCA triplet subsite from the Fl position, the amino acid sequence 
QSGSLTR (SEQ ID NO:342) is required; while, for optimal binding of the same subsite 
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sequence from F2 or F3, QSGDLTR (SEQ ID NO:220) should be used. Accordingly, different 



^ zinc finger acid sequences may be needed tQ spedfy & particular 



sequence, 



depending upon the location of the subsite within the target sequence and, hence, upon the 



position of the finger in the protein.J-''" 



Please replace the paragraph beginning on page 33, line 29 with the following rewritten 
paragraph: 

,| Positional effects were also observed for zinc fingers recognizing GAT and GGT 



subsites. The zinc finger amino acid sequence QSSNLAR (SEQ ID NO:1765) (-1 through +6) is 
expected to bind to GAT, based on design rules. However, this sequence selected GAT only 
from the Fl position, and not from the F2 and F3 positions, from which the sequence GAA was 
preferentially bound (Figure 3B). Similarly, the amino acid sequence QSSHLTR (SEQ ID 
<j\ NO:835) which, based on design rules, should bind GGT, selected GGT at the Fl position, but 
not at the F2 and F3 positions, from which it preferentially bound GGA (Figure 3C). 
Conversely, the amino acid sequence TSGHLVR (SEQ ID NO: 1425) has previously been 
disclosed to recognize the triplet GGT, based on its selection from a randomized library of zif268 
finger 2. U.S. Patent No. 6,140,081. However, TSGHLVR (SEQ ID NO:1425) was not specific 
for the GGT subsite when located at the Fl position (Figure 3C). These results indicate that the 
binding specificity of many fingers is position dependent, and particularly point out that the 
sequence specificity of a zinc finger selected from a F2 library may be positionally limited! 



13 



AttyDktNo. 8325-0011.21 
S11-US3 
09/990,186 



Please replace the table on page 66 with the follo wing amended table: 

A TABLE 6 
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In the claim: 



1 . (Amended) A method for designing a zinc finger protein comprising a first (Fl), a second 
(F2), and a third (F3) zinc finger, ordered Fl, F2, F3 from N-terminus to C-terminus that binds to 
a target site comprising, in 3' to 5' direction, a first (SI), a second (S2), and a third (S3) target 
subsite, each target subsite having the nucleotide sequence GNN, the method comprising the steps 
of 

selecting theFl zinc finger such that it binds to the SI target subsite, wherein if S 1 comprises 
GAA, F 1 comprises the amino acid sequence QRSNLVR (SEQ ID NO: 1 58); if S 1 comprises GAG, 
Fl comprises the amino acid sequence RSDNLAR (SEQ ID NO.130); if SI comprises GAC, Fl 
comprises the amino acid sequence DRSNLTR (SEQ ID NO:395); if SI comprises GAT, Fl 
comprises the amino acid sequence QSSNLAR (SEQ ID NO:1765); if SI comprises GGA, Fl 
comprises the amino acid sequence QSGHLAR (SEQ ID NO:413); if SI comprises GGG, Fl 
comprises the amino acid sequence RSDHLAR (SEQ ID NO:127); if SI comprises GGC, Fl 
comprises the amino acid sequence DRSHLTR (SEQ ID NO:1506); if SI comprises GGT, Fl 
comprises the amino acid sequence QSSHLTR (SEQ ID NO:835); if SI comprises GCA, Fl 
comprises QSGSLTR (SEQ ID NO:342); if S 1 comprises GCG, F 1 comprises RSDDLTR (SEQ ID 

NO:188);ifSl comprises GCC.F1 comprises ERGTLAR(SEQ ID NO:131);ifSl comprises GCT, 
Fl comprises the amino acid sequence QSSDLTR (SEQ ID NO:1450); if SI comprises GTA, Fl 
comprises the amino acid sequence QSGALTR (SEQ ID NO:1398); if SI comprises GTG, Fl 
comprises the amino acid sequence RSDALTR (SEQ ID NO: 153); if SI comprises GTC, Fl 
comprises the amino acid sequence DRSALAR (SEQ ID NO: 184); 

selecting the F2 zinc finger such that it binds to the S2 target subsite, wherein S2 comprises 
GAA, F2 comprises the amino acid sequence QSGNLAR (SEQ ID NO:801); if S2 comprises GAG, 
F2 comprises the amino acid sequence RSDNLAR (SEQ ID NO: 130); if S2 comprises GAC, F2 
comprises the amino acid sequence DRSNLTR (SEQ ID NO:395); if S2 comprises GAT, F2 
comprises the amino acid sequence TSGNLVR (SEQ ID NO: 1442); if S2 comprises GGA, F2 
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comprises the amino acid sequence QSGHLQR (SEQ ID NO:287); if S2 comprises GGG, F2 
comprises the amino acid sequence RSDHLSR (SEQ ID NO:229); if S2 comprises GGC, F2 
comprises the amino acid sequence DRSHLAR (SEQ ID NO: 1092); if S2 comprises GGT, F2 
comprises the amino acid sequence TSGHLSR (SEQ ID NO: 1201); if S2 comprises GCA, F2 
comprises the amino acid sequence QSGDLTR (SEQ ID NO:220); if S2 comprises GCG, F2 
comprises the amino acid sequence RSDDLQR (SEQ ID NO: 1844); if S2 comprises GCC F2 
comprises the amino acid sequence DRSDLTR (SEQ ID NO:417); if S2 comprises GCT, F2 
comprises the amino acid sequence QSSDLTR (SEQ ID NO: 1450); if S2 comprises GTA, F2 
comprises the amino acid sequence QSGALAR (SEQ ID NO:3339); if S2 comprises GTG, F2 
comprises the amino acid sequence RSDALSR (SEQ ID NO:237); if S2 comprises GTC, F2 
comprises the amino acid sequence DRSALAR (SEQ ID NO: 184); and 

selecting the F3 zinc finger such that it binds to the S3 target subsite, wherein if S3 comprises 
GAA, F3 comprises the amino acid sequence QSGNLAR (SEQ ID NO:801); if S3 comprises GAG, 
F3 comprises the amino acid sequence RSDNLTR (SEQ ID NO:231); if S3 comprises GAC, F3 
comprises the amino acid sequence DRSNLTR (SEQ ID NO:395); if S3 comprises GAT, F3 
comprises the amino acid sequence TSANLSR (SEQ ID NO:377); if S3 comprises GGA, F3 
comprises the amino acid sequence QSGHLQR (SEQ ID NO:287); if S3 comprises GGG, F3 
comprises RSDHLSR (SEQ ID NO:229); if S3 comprises GGT, F3 comprises the amino acid 
sequence TSGHLVR (SEQ ID NO: 1425); if S3 comprises GCA, F3 comprises the amino acid 
sequence QSGDLTR (SEQ ID NO:220); if S3 comprises GCG, F3 comprises the amino acid 
sequence RSDDLTR (SEQ ID NO:188); if S3 comprises GCC, F3 comprises the amino acid 
sequence DRSDLTR (SEQ ID NO:417); if S3 comprises GCT, F3 comprises the amino acid 
sequence QSSDLQR (SEQ ID NO:132); if S3 comprises GTG, F3 comprises RSDALTR (SEQ ID 
NO: 153); and if S3 comprises GTC, F3 comprises the amino acid sequence DRSALAR (SEQ ID 
NO: 184); 

thereby designing a zinc finger protein that binds to a target site. 
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Attached hereto is a marked-up version of the changes made to the specification and claims 
by the current amendment. The attached page is captioned " Version with markings to show 
changes made. " 
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II. REMARKS 



The specification has been amended to insert the sequence identification numbers. Claim 
1 is pending and has been amended herein to move a clause from the first step into the second step 
and to correct a typographical error. Support for the amendment can be found throughout the 
original claim and specification as filed, for example in the Figures. No new matter has been added 
as a result of this amendment and entry thereof is respectfully requested. 



Applicants believe the claim is in condition for allowance and request early notification to 
that effect. 



ROBINS & PASTERNAK LLP 
90 Middlefield Road 
Suite 200 

Menlo Park, CA 94025 
Tel. 650-325-7812 
Fax. 650-325-7823 



III. CONCLUSION 



Respectfully submitted, 





Dahna S. Pasternak 
Registration No. 41,41 1 
Attorney for Applicants 



